Skip to main content
Search roles

Data Science / AI Intern – Literature Mining & Graph Modeling

Location Waltham, Massachusetts, United States Job ID R-244721 Date posted 27/01/2026

AstraZeneca is seeking Master’s and PhD students studying Biology, Computer Science, Chemistry, Physics, Engineering, Biomedical Science, Pharmacology, Data Science, Bioinformatics, or a related discipline for a 10-week internship role at our site in Waltham, MA from June 01, 2026- August 07, 2026.  This internship sits at the intersection of data engineering, biomedical NLP, and translational science, enabling faster insight generation for R&D teams. 

Position Description:

  • Build an end-to-end pipeline turning literature (papers, abstracts, patents) into a standardized knowledge graph with contextualized evidence.
  • Handle source selection, inclusion/exclusion criteria, updates, and data snapshots.
  • Develop NLP for entity recognition, relation extraction, assertion detection, and context tagging (drug, indication, resistance, biomarker, outcome).
  • Encode domain relations (e.g., Drug–mechanism→Gene/Pathway; Biomarker–modulates→Outcome; ADC–targets→Antigen).
  • Map entities to controlled vocabularies; manage synonyms, disambiguation, and canonical IDs.
  • Implement edge-level confidence scoring (source quality, claim type, co-occurrence, citations, model certainty) with full evidence provenance.
  • Build graph storage (property graph or RDF) and queryable APIs.
  • Deliver interactive visualization (UI or notebook) with filters, context toggles, and evidence drill-down.
  • Define metrics, run error analyses, and validate with scientific stakeholders.
  • Ensure reproducibility and documentation: version models/data; record architecture, assumptions, benchmarks; provide user guides.
  • Present outcomes to data science, oncology, and translational medicine teams.

Position Requirements:      

  • Master’s and PhD students studying Biology, Computer Science, Chemistry, Physics, Engineering, Biomedical Science, Pharmacology, Data Science, Bioinformatics, or a related discipline.
  • Candidates must have an expected graduation date after August 2026.
  • US Work Authorization is required at time of application.
  • This role will not be providing OPT support.
  • NLP and ML: NER, relation extraction, transformers; Python-based workflows.
  • Graph/data modeling: experience with Neo4j, NetworkX, or RDF/SPARQL.
  • Domain knowledge: genes, pathways, biomarkers, therapeutic modalities (incl. ADCs) preferred.
  • Reproducibility: version control, environment management, documentation.
  • Soft skills: problem-solving, communication, collaboration.
  • Tech stack: Python (spaCy, Hugging Face), scikit-learn; PyTorch or TensorFlow.
  • Data & viz: pandas; PySpark or Dask; Plotly/Dash, D3.js, Neo4j Bloom.
  • Dev practices: Git, Conda/Poetry, Docker, experiment tracking.
  • Ability to report onsite to Waltham, MA site 3-5 days per week.
  • This role will not provide relocation assistance.
  • Compensation range: $41-$48 per hour

Date Posted

28-Jan-2026

Closing Date

12-Feb-2026

Our mission is to build an inclusive environment where equal employment opportunities are available to all applicants and employees. In furtherance of that mission, we welcome and consider applications from all qualified candidates, regardless of their protected characteristics. If you have a disability or special need that requires accommodation, please complete the corresponding section in the application form.



AstraZeneca embraces diversity and equality of opportunity. We are committed to building an inclusive and diverse team representing all backgrounds, with as wide a range of perspectives as possible, and harnessing industry-leading skills. We believe that the more inclusive we are, the better our work will be. We welcome and consider applications to join our team from all qualified candidates, regardless of their characteristics. We comply with all applicable laws and regulations on non-discrimination in employment (and recruitment), as well as work authorisation and employment eligibility verification requirements.

Join our Talent Network

Be the first to receive job updates and news from AstraZeneca

Sign up
Glassdoor logo Rated four stars on Glassdoor

Great culture, great work assignments, supportive management. Rotation opportunity within the company. They value inclusion and diversity.